An extended formalism for preferential attachment 
in heterogeneous complex networks 
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In this paper we present a framework for the extension of the preferential attachment (PA) model 
to heterogeneous complex networks. We define a class of heterogeneous PA models, where node 
properties are described by fixed states in an arbitrary metric space, and introduce an affinity 
function that biases the attachment probabilities of links. We perform an analytical study of the 
stationary degree distributions in heterogeneous PA networks. We show that their degree densities 
exhibit a richer scaling behavior than their homogeneous counterparts, and that the power law 
scaling in the degree distribution is robust in presence of heterogeneity. 
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A complex network is a set of nodes and links with a 
non-trivial topology In the current effort to achieve 
a single coherent framework for complex systems, net- 
work theory has focused on the underlying principles that 
govern their topology 0, H[ • Dynamical network models 
4] are stochastic discrete-time dynamical systems that 
evolve networks by the iterated addition/subtraction of 
nodes/links. These models regard network topology as 
an emergent property of the network evolution, focus- 
ing on the mechanisms that take place on such process. 
Among these mechanisms, preferential attachment (PA) 
Q enjoys a foremost position in network literature. 

The PA model by Barabasi and Albert @ has provided 
a minimal account of mechanisms for the emergence of 
scale-free networks 0, [§[ . Such networks are character- 
ized by a degree distribution according to a power law, 
P(k) = k~ 7 , which leads to a non- negligible presence of 
hubs (highly connected nodes). The PA model assumes 
two mechanisms: growth and preferential attachment. 
The process starts with a seed and a new node is added 
to the network at each step. Each new node has a number 
m of links attached, which are connected to the existing 
nodes following the so-called attachment rule: the link- 
ing probability of a network node vi is proportional to its 
degree fcj, II(uj) = ki/J^j kj- This step is iterated until 
a number N of nodes have been added. 

The PA model is strictly topological as the node de- 
grees {ki} are the only metrics that drive network evo- 
lution. Nevertheless, the assumptions implicit in the PA 
model are not valid for a wide class of complex systems. 
Often, interactions between individual elements are me- 
diated by their intrinsic properties. Although network 
theory has led to a significant improvement in our under- 
standing of complex systems, it has been argued that its 
framework should be augmented in order to improve our 



modelling of complexity [l], Q . We will refer to hetero- 
geneous networks as networks where node intrinsic prop- 
erties induce affinities in their interactions. We consider 
such networks as a logical first step in addressing the 
complication introduced by the influence of individual 
elements on the network structure. 

In recent years several dynamical models have incorpo- 
rated the influence of eleme nt p roperties. These include 



weighted networks 
competition models 
[ttI ]. a metric model 
work automata [2011. 
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social models [H, [T3 |. 
16j, a local knowledge model 
a gas-like model and net- 
In this letter we propose a gen- 
eral class of heterogeneous PA models where a function 
measures the affinity between node intrinsic properties 
(states) and biases the attachment probabilities of links 
in a generalized PA rule. The introduction of affinities 
aims to provide a more realistic modelling of the struc- 
ture of numerous real systems that exhibit biased inter- 
actions. The proposed class contains previous models as 
particular cases, and provides a framework for the sys- 
tematic analysis of the influence of heterogeneity in PA 
networks. 

The PA model can be easily generalized to hetero- 
geneous networks by imposing a metric structure on 
the node states, while preserving the basic mechanisms 
of growth and preferential attachment of the original 
Barabasi-Albert model. Next we formally define a gen- 
eral class of heterogeneous PA models: 

Definition 1 A heterogeneous PA model with global 
affinity Mi is a 3-tuple {R,p,&), where: (1) R is an 
arbitrary metric space. The elements x € R are the node 
states. (2) p : R >— > K + U {0} is a nonnegative function 
with unit measure over R referred as node state distri- 
bution. (3) a : R 2 i— > M + U{0} is a nonnegative function 
referred as affinity of the interactions. 

The class of heterogeneous models with global affinity 
(HPAg) is the set of all 3-tuples that satisfy the condi- 
tions in Definition 1. This formalism defines the evolu- 
tion of a network according to the following rules: 
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(i) The nodes Vi are characterized by their state x, £ 
R. The node states describe intrinsic properties deemed 
constant in the timescale of evolution of the network. 

(ii) The growth process starts with a seed composed by 
iVo nodes (with arbitrary states X{ € R) and L links. 

(iii) A new node v a (with m links attached) is added to 
the network at each iteration. The number m is common 
for all the added nodes and remains constant during the 
evolution of the network. The newly added node is ran- 
domly assigned a state x following the distribution p(x). 

(iv) The to links attached to v a are randomly connected 
to the network nodes following a distribution {Il(vi)} 
given by an extended attachment rule, 



U(vi 



tt(v 4 ) = k t ■ a(xi,x a ). (1) 



The visibility ir of a node i>i in the attachment rule is 
given by the product of its degree h t and its affinity a 
with the newly added node v a , which is itself a function 
of the states x% and x a . It thus can be seen that for each 
interaction a biases the degree fc, of the candidate node. 
Steps (iii) and (iv) are iterated until a desired number 
of nodes has been added to the network. To sum up, 
the choice of the tuple (R, p, a) determines the form of 
heterogeneity in the attachment mechanism. 

Next we derive an analytical solution for the stationary 
degree distribution P(k) of the class HPA g of heteroge- 
neous models. The solution is obtained by rate equations 
which establish a balance in the flows of degree densities 
over a partition of a growing network. Let us first define a 
sequence of functions {f(k, x, iV)} jv>o which measure the 
probability density of a randomly chosen node having de- 
gree fceN and state x e R in a network at the iteration 
t — N. The degree densities are local metrics, thus they 
uniformly converge when N — > oo to a stationary density 
function f(k,x). Finally, the stationary degree distribu- 
tion P(k) measures the probability of a randomly chosen 
node having degree k in the thermodynamic limit. 

The rationale for the choice of / in modelling the evolu- 
tion of the degrees is simple: in a heterogeneous model, 
nodes with equal degree may contribute differently to 
changes in P(k, N) according to their state, despite being 
homogeneously accounted by P. Thus, for each (k, N) we 
need to know the influence of x on the degree distribu- 
tion. We will denote by V(x) = {vi,Xi = x] the subset 
of nodes in the network with state x. Assuming that the 
assignation of states is uncorrelated with the topology 
of the growing network, and that there are no linking 
events between existing nodes, the sequence {/} can be 
modelled on each V{x). For each x, the form of the equa- 
tion will be L\ — L 2 = Ri — R2, where: 
L\ = density of nodes with degree k at t = N + 1; 
L2 = density of nodes with degree k at t — N; 
Ri = increase in density due to nodes with degree k — 1 
that have gained a link at t = N; 

i?2 = decrease in density due to nodes with degree k that 
have gained a link at t = N. 



The total density of nodes with state x and degree k 
at t = N + 1 is (N + l)f(k, x,N + 1). Here we assume 
that N is large and no incoming node is rejected due 
to lack of affinity, therefore we approximate the network 
size by t. Likewise, the total density of nodes with state 
x and degree k at t — N is Nf(k,x,N). In order to 
estimate the changes in / due to the adquisition of new 
links, we introduce a mean-field fitness factor w{x) that 
measures the average affinity of a network node with state 
x towards the incoming nodes, 



w(x) = I <r(x,y)p(y)dy. 

JR 



(2) 



The probability of a node being chosen for attachment is 
proportional to kw(x), and the decrease in the density 
of nodes with state x and degree k due to their attach- 
ment of a new link is proportional to k f(k,x,N)w(x). 
This term must be normalized by the increase in all the 
densities (irrespective of their degree k and state x) in 
the network at t = N, measured by a partition factor 



Vfc / f(k,x,N)w{x)dx. 

JR 



(3) 



As in each iteration there are to links attached to a newly 
added node, the decrease in the total density of nodes 
with state x and degree k due to nodes with degree k 
gaining a link and being promoted to degree k + 1 is 
mkf(k,x,N)w(x)/tp(N) at each step. Here we assume 
that for large N the probability that a single node re- 
ceives more than one link attached to the same new node 
becomes negligible. Likewise, the increase in the total 
density of nodes with state x and degree k due to nodes 
with state x and degree k— 1 gaining a link and being pro- 
moted to degree k is m(k — l),f(k — 1, x, N)w(x)/i(>(N). 
The resulting density rate equation for k > to is: 

(N + l)f(k, x,N +1) — Nf(k, x, N) = 
mw(x) . 



■[(k-l)f(k-l,x,N)-kf(k,x,N)]. (4) 



When k = m, there is no increase in /(to, x, N) due to 
the promotion of nodes with k = m — 1. However, each 
newly added node has degree m and this contribution to 
/(to, x, N) is distributed according to p(x), thus: 

(N + l)/(m, x, N + 1) - Nf(m, x, N) = 
mw(x) 

= p(x)~ ^ mf(m,x,N). (5) 

There are no nodes with degree k < to, since when 
N — > 00 all the links attached to newly added nodes find 
receptive nodes in the network, therefore the previous 
equations define all the possible cases in each iteration. 

In the thermodynamic limit N — > 00, f(k, x, N + 1) = 
f(k,x,N) — f(k,x) and the rate equations become: 

f(k,x)= (6) 
f ^[(k-l)f(k-l,x)-kf(k,x)} forfc>TO, 
I p(x) — mw ( x ) mf(k, x) for k = to, 
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These equations are coupled by the stationary parti- 
tion factor ip = J2 k k J f(k,x)w(x)dx. Notice that in 
the homogeneous PA model, w(x) — lVx G R and thus 
ip = ^2ukP(k), which is equal to the average network 
degree z = (2Nm + 2L )/(N + N ). In the stationary 
limit N ^ N and Nm > L , thus ip ~ 2m. 

To the contrary, in the heterogeneous models w(x) con- 
tinuously weighs the density /(fc, x) in the integral asso- 
ciated to each degree k in the stationary partition factor 
ip. In order to decouple Eq. let us assume that w(x) 
doesn't change too much on R so that we may approxi- 
mate w(x) by its mean w = J R w(x)p(x)dx, then 

k k 

The resulting decoupled system becomes: 

f(k,x)= (8) 
\ i^f) K fc - - 1.^) - kf{k, x)] for k > m, 

P{x)-\{^)mf{k,x) 



for k = m, 



We introduce a normalized fitness w — w(x)/w, which 
measures the fitness of a state relative to the average 
fitness of the network nodes. Solving Eq. [8] we obtain 



f(m,x) 



2p 



wm + 2 ' 



(9) 



/(M) = W -f , ir /( fc " i'Z) for k > m - 



wk + 2 

Thus the solution for the stationary density for k > m is 

fc 



mx)= n 



w(j - 1) 2p 



i j — m + 1 



(10) 



and integrating the density in Eq. rTUlover the state space 
R the stationary degree distribution is 



P(k) 



n 



w(j - 1) 



2p 



-dx. 



(11) 



It is important to remark that the solutions in Eqs. [TU] 
and [11] are valid for any model in our class, since they 
do not make any assumption regarding the geometry of 
space R, the form of affinity a or the distribution p. 

Furthermore we may assume that w(x) > Vx G R 
without loss of generality, otherwise a(xo, y) = Vy € R 
since a > 0, which would mean that xq is "inert" and 
may be excluded from R. Then w(x) > Vx G i? and we 
may write the density in Eq. [10] for fc > m as 



f(k,x) 



2p/w B(k,l + 2/w) 
m + 2/w B(m, 1 + 2/w) ' 



(12) 



where Legendre's Beta function B(y,z) = L t v 1 (1 — 
t) z ~ 1 dt for y, z > satisfies the functional relation 
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FIG. 1: Solution for the stationary density f(k,w) in hetero- 
geneous PA models (Eq. I10[) with p(x) — 1 and m = 3. 



r(a)/r(a + 6) = B(a,b)/Y{b) for Euler's Gamma V func- 
tion. Likewise, integrating the density in Eq. [12] we 
obtain for fc > m the stationary degree distribution 

Jr m + 2/wB(m, 1 + 2/w) v ; 

Notice that Eq. [T2l reduces to /(m,x) in Eq. [9] when 
fc = m, thus Eqs. [T2l and [T3l are valid for fc > m. Also, 
notice that the stationary densities /(fc, x) in the hetero- 
geneous formalism, as given by Eq. 1121 follow the form 
of a Beta function with arguments (fc, 1 + 2/w). 

In particular cases it is possible to obtain simpler ex- 
pressions for /(fc,x). First, in the homogeneous case, 
w(x) = 1, thus w = w = 1 and Eq. ITD1 yields 



f(k,x) 



2pm(m + 1) 
fc(fc + l)(fc + 2)' 



(14) 



and the density functions /(fc,x) asymptotically follow 
power laws /(fc,x) ~ fc -7 with exponent 7 = 3 irre- 
spective of the node states. This is reasonable, as the 
dynamics of the homogeneous PA network is not affected 
by them, thus the asymptotic behavior of all density com- 
ponents is the same. If furthermore we assume that p{x) 
is uniform over R, then any subset V(x) of nodes will ex- 
hibit the same distribution as the whole network. Like- 
wise, the degree distribution P(k) asymptotically follows 
a power law with exponent 7 = 3 like the Barabasi- Albert 
model [8], irrespective of the distribution p. 

In presence of heterogeneity the fitness w(x) is gener- 
ally not constant over R. It is still possible to obtain 
simple forms of /(fc,x) from Eq. [TU] for discrete values 
of w that render the denominator a product of integers. 
For instance, let x be such that w — 2 > 1, then 



f(k,x) 



pm 



fc(fc + l)' 



(15) 
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which asymptotically behaves as a power law with ex- 
ponent 7 = 2. Conversely, letting x be such that 
w = 2/3 < 1, then Eq. HQ] yields 



f(k,x) 



3pm(m + l)(m + 2) 
fc(fc + l)(fc + 2)(fc + 3)' 



(16) 



which asymptotically behaves as a power law with ex- 
ponent 7 = 4. By extension, normalized fitness values 
w = 2/n with integer n yield density functions f(k,x) 
with an asymptotic power law behavior with 7 = 1+2/w. 

More generally, it can be proved that the Beta func- 
tion behaves as B{%), z) ~ y -2 when y — > 00. There- 
fore, Eq. [T2l implies that f(k,x) behaves asymptotically 
as B(k, 1 + 2/w) ~ fc~( 1+2 / 1 "). The density components 
/(A:, a;) in heterogeneous PA networks exhibit a multiscal- 
ing according to power laws k" 1 ^ along the continuous 
spectrum of normalized fitness w. The scaling exponents 
are inversely related to w according to 7(u>) = 1 + 2/w 
so that they span themselves a continuum. 

Thus, as w increases (resp. decreases), the exponent 
7 of / decreases (resp. increases). Nodes with states 
more fit than the average (w > 1) adopt densities with 
7 < 3, which exhibit a slower asymptotical decay, and 
tend to produce more hubs. Nodes with states less fit 
than the average (w < 1) adopt densities with 7 > 3, 
which exhibit a faster asymptotical decay. These results 
are verified in Fig. [T] which shows the plot of the den- 
sity /. Although the shape of / depends on the particular 
heterogeneity of the network, its asymptotic behavior can 
be characterized in a general way by plotting the depen- 
dence of / on w(x) instead of x. 

The behavior described points out to a signature of 
heterogeneity in the topology of PA networks: the fit- 
ness w(x) will tend to fluctuate on R and the density 
components f(k,x) will exhibit different scaling expo- 
nents. This prompts us to suggest a procedure for em- 
pirically detecting such signature in real networks. In 
order to check the role of a certain degree of freedom in 
the network evolution, one could select different states 
{x} varying only in such dimension, select subsets of 
nodes Vg(x) — {vi, d(x^ — x) < 6} with states in a small 
neighborhood of the states x. If the partial distributions 
{P(k\x)} exhibit power laws with different exponents, it 
would be feasible to assume this dimension (or another 
one correlated with it) is biasing the attachment mech- 
anism. To check the existence of heterogeneity it would 
be enough to find one such subset of nodes whose par- 
tial distribution {P(k\x)} exhibits a power-law with an 
exponent different to the one in P{k). 

As for the degree distribution P(k) of the heteroge- 
neous models, Eq. QTJ it is obtained by the integration of 
the density components which form a set of power-laws 
with varying exponents. The level of relative variability 
present in the fitness w(x) over the state space R will 



determine the ultimate composition of P(k). Choices of 
(R,p,a) that yield little fluctuations in w(x) on R will 
translate into density functions f(k,x) deviating slightly 
from the homogeneous power law, and degree distribu- 
tions similar to the Barabasi-Albert case. On the other 
hand, choices of (i?, p, a) that yield larger fluctuations in 
w(x) will translate into a wider spectrum of exponents 7 
in the density components, and thus a larger deviation 
of the degree distribution from the homogeneous case. 

While we need to know the tuple (R, p, a) to obtain 
the distribution P{k), it is still possible to make rough 
predictions about its behavior in heterogeneous PA net- 
works. We may regard the degree distribution in such 
networks as a superposition of power laws whose expo- 
nents are distributed around the homogeneous 7 = 3. 
For low degrees it can be expected that averaging low 
and high 7 components will tend to produce a behav- 
ior of P(k) similar to the one observed in homogeneous 
networks. When k — > 00, the tail of P(k) will be domi- 
nated by the slowest decaying components, associated to 
states with highest w. Furthermore, given that w will 
be distributed by definition around 1, the P(k) of all the 
models in the class will exhibit scaling exponents satis- 
fying 1 < 7 < 3. This is remarkable since the empirical 
observations in real scale-free networks show that most 
exponents fit within this interval. 

In summary, we have proposed a generalization of the 
PA model to heterogeneous networks. We have defined 
a general class of heterogeneous PA models where node 
properties are described by fixed states in an arbitrary 
space, and where an affinity function biases the attach- 
ment probabilities of links. We have obtained analytical 
solutions for the degree densities and degree distribution 
of heterogeneous networks in the stationary limit. We 
have shown that these networks exhibit a richer scal- 
ing behavior than their homogeneous counterparts, a 
multiscaling of their degree densities according to power 
laws with exponents spanning a continuum. Such phe- 
nomenon leaves a signature of heterogeneity in the topol- 
ogy of PA networks that can be empirically checked in 
real networks. We have also shown the relationship be- 
tween the variability in the exponents and the fluctua- 
tions in the fitness over the state space, and the conse- 
quences of this on the asymptotic behavior of the degree 
distribution. Finally we have shown that the degree dis- 
tributions of all heterogeneous models in our class exhibit 
power laws with exponents within the limits empirically 
observed in real networks. These results suggest that the 
ubiquity of real scale-free networks with otherwise differ- 
ent structural properties may be partially explained by 
PA mechanisms with varying levels of heterogeneity. 
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